Deriving Landauer's erasure principle from statistical mechanics 
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Landauer's erasure principle [lj states that whenever a 
single bit of information is erased, the entropy in the envi- 
ronment to which the information storing system is con- 
nected must increase by at least fcln2, where k is Boltz- 
mann's constant. (This principle is also often stated by 
saying that the energy of the environmcnt must increase 
by kT In 2, which is equivalent to the former statement for 
a canonical reservoir at temperature T.) Landauer's era- 
sure principle is an important tool in understanding ther- 
modynamics, particularly Maxwell's demon |2|. y|. |j. |tj . 
and in understanding the relationship between thermo- 
dynamics and classical and quantum information the- 
ory 0, Q • A comprehensive review of the literature on 
Landauer's erasure and Maxwell's demon may be found 
in 0. 

Here we present a concise, but rigorous derivation of 
Landauer's erasure principle from the postulates of sta- 
tistical mechanics, along with small set of other funda- 
mental postulates, all of which are uncontroversial. Cer- 
tainly it has been discussed previously how Landauder's 
principle follows from the second law of thermodynam- 
ics 0,12 B El EU ) and the present treatment is offered 
primarily in the hope that it is more transparent than 
its predecessors. Previous mathematical derivations of 
Landauer's pr i ncip le have usually focussed on example 
systems 0, 0, Il3| . While the treatment in 0] does ap- 
ply to a wide class of systems, it is simpler to proceed 
directly from the bàsic postulates of statistical mechan- 
ics, and the resulting derivation is no less rigorous. It 
is hoped that this treatment will at the least provide a 
uscful pedagogical aid, and make it crystal clear that 
Landauer's principle is as general as those of statistical 
mechanics. If one wishes to argue against the applicabil- 
ity of Landauer's principle, one must argue against the 
applicability of at least one of the postulates of statistical 
mechanics. 

Before we begin we must define what is meant by "eras- 
ing the information" contained in a system S. What is 
meant is a process by which all the possible states of S 
are mapped to a single state. Thus, after such a process, 
one no longer knows what state the system was in before, 
and the information has therefore been erased. 

It is important to understand that to perform such an 
erasure operation we must ultimately use a process which 



is invertible — that is, we must use a process that is log- 
ically reversible. This is because the fundamental laws of 
physics are reversible, and as such reversible operations 
are the only operations that are available to us. (Quan- 
tum mechanically this means that the erasure operation 
must be performed with a unitary transformation.) How- 
ever, as stated this would appear to lead a contradiction, 
since the defmition of erasure we have just given is that 
two or more states are mapped to a single state, a oper- 
ation which is clearly not invertible. What we actually 
mean by erasure then is the following: we apply a re- 
versible operation which takes the many initial states of 
the system and maps them to different microstates of an 
environment so that these states are no longer accessible 
to us. Having achieved this, the system is left in a sin- 
gle state, and since we have no access to the microstates 
of the environment, we no longer have any information 
about the initial state of the system. As a result we say 
that the information has been erased. The use of an en- 
vironment is thus essential for the process of erasure. 

To begin with we present a simple argument, which 
while persuasive is nevertheless insumcient to derive Lan- 
dauer's principle. (To make this argument concise I as- 
sume a knowledge of quantum mechanics. Readers un- 
familiar with quantum mechanics may skip this para- 
graph.). Consider the physical system containing the in- 
formation we wish to erase, 5, to be described by the 
density matrix p. The amount of information which S 
contains is given by the von Neumann entropy S v i\r(p)- 
If we erase the information in S by interacting the system 
with an environment, £ , then we must employ a unitary 
operation on the joint state-space of S and E. The sys- 
tem and environment are initially separate, and a result 
the entropy of the joint system is simply S v n(p) + S v n(o~) 
where a is the state of the environment. Since the uni- 
tary interaction cannot change the entropy of the total 
system, and the final entropy of S is zero (because it is in 
the single pure state (|0)), the von Neumann entropy of 
the environment must have increased by S v n(p), which 
is the amount of the information which has been erased. 
Since the thermodynamic entropy is given by S — kS V N, 
where k is Boltzmann's constant, the entropy of the envi- 
ronment must increase by kS V N(p)- If the system initially 
contains one bit of information, then S v n(p) = ln 2, so 



2 



that the entropy of the environment increases by fcln2. 

The above argument requires that we equate the von 
Neumann entropy with the thermodynamic entropy, and 
will thus leave many people uneasy. However, the argu- 
ment is incomplctc for another reason. It implies that 
the increase in entropy of the environment depends on 
the initial state of the system. In fact, the entropy in 
crease has nothing to do with the initial state of the sys- 
tem p 0, 0, El ■ It is actually a result of the fact that 
the transformation which achieves the erasure is non- 
invertible. In particular, S can be in a pure state, and 
the increase in the entropy of the environment is still 
fclnM, where M is the dimension of the system. This is 
clear because if the system starts in a mixed state, say an 
equal probability of being in the two states |0) or |1), this 
simply means that the system is randomly prepared in 
state |0) or |1). Thus on any given run the system has a 
definitc state, and one would expect Landauer's principio 
to hold true for every run. 

To derive Landauer's principle, and in doing so see that 
the entropy gain of the reservoir is independent of the ini- 
tial state of the system (i.e., independent of what infor- 
mation the system holds) , we will need to use one of the 
postulates of statistical mechanics. Statistical mechanics 
arose out of the observation that once the macroscopic 
properties of a system are specified (that is, the macro- 
scopically observable state- variables such as energy, pres- 
sure etc), there are very many possible physical states 
that that system could be in. Two of the postulates 
of statistical mechanics are 1. That each of the possible 
physical states, being those which are consistent with the 
macroscopic properties of the system) are equally likely, 
and 2. That the thermodynamic entropy is given by 
fclniV, where N is the total number of possible physi- 
cal states [ÏJ| . It is the second of these which will lead to 
Landauer's principle. We quote postulate 1 merely be- 
cause it shows the connection between information the- 
oretic entropy and statistical mechanical entropy: if all 
the microstates are equally likely, then the information 
theoretic entropy of the system is ln TV. However, this 
connection is not required in what follows. 

We will stick to classical systems in the following 
derivation because the use of quantum systems merely 
complicates the language without adding anything im- 
portant. Without loss of generality we will take our sys- 
tem to have two states, and label these states with the 
binary digits and 1. The environment, or reservoir, has 
a great many microstates. We will label each of these 
states with a unique number, and write this number in 
trinary. Thus each state will be represented as a long 
string of trinary digits, each digit having the value 0, 1 
or 2. (Trinary is not essential, but will be convenient.) 

At the start of the process the system may be in either 
of its two possible states. We will give the environment 
an initial entropy of fclniV. This means that, while the 
environment does exist in some definite microstate, there 
are N — 1 other microstates whi ch g ive the same value for 
its macroscopic state- variables [Ta]- Thus, at the start of 



the procedure the environment is allowed to be in any 
one of a set of N possible states. Considering our trinary 
representation for each of these states, some of the trinary 
digits (trits) will be the same for all the N states, but 
the value of many of thcm will dcpcnd upon which of the 
N states the system is in. 

As explained above, to perform the erasure of the sys- 
tem, one effects a reversible transformation coupling the 
system to the environment. Further, this reversible trans- 
formation is fixcd - that is, it is independent of the initial 
state of the system, and independent of which of the N 
states the environment is in. The reason for this will be 
explained later. Recali that the reversibility of the oper- 
ation implies that it must be possible to determine the 
original state of the system and environment from the 
final state - put another way, the transformation must 
be invertible. 

Now we apply the erasure condition, which states that 
the system must be in the state after the process. Com- 
bining this with the fact that the operation must be in- 
vertible, means that the environment must record which 
state the system was initially in, since the system does 
not. This can be done by mapping one of the trits of the 
environment to state if the system was in state 0, and 1 
if it was in state 1. However, now the initial state of the 
trit must also be recorded somewhere for the process to 
be invertible. If we record the state of this trit in another 
trit, we are again faced with the same problem. We can- 
not store the state of the trit in the system, because the 
final state of the system must be 0. The consequence of 
all this is that the initial state of the trit in which we will 
store the initial state of the system must be fixed at the 
start so that we do not need to store its value anywhere. 
We will take the initial value of this trit to be 2. 

To sum up the above discussion, we see that the process 
of erasure takes the system to a fixed state, and one of 
the trits which was initially in a fixed state, to one of 
two states. (One can also imagine that two trits are used 
instead, or that an initially fixed trit is used to store 
the initial state of a different trit which is altered, but 
the result is the same.) At the end of the procedure, the 
environment is now in one of 27V possible states, precisely 
because a trit that was initially fixed, is now in one of 
two states. 

Finally, to obtain Landauer's erasure principle, we ap- 
ply the erasure condition again: If the information in the 
system is to be erased, then the macroscopic observables 
must not dcpcnd on the initial state of the system (other- 
wise we would still have access to the information we are 
trying to erase). Thus, the macroscopic state- variables 
must be the same for all of these 2N states. Each of 
these states is thus a member of a set of at least 2N 
states for which the macroscopic state-variables are the 
same. As a result, the final entropy of the environment 
is at least kln(2N) = klnN + fcln2, and has therefore 
increased by at least ATn2. 

Since the final states are members of a set of 2N states 
for which the macroscopic state-variables are the same, 
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whereas the initial states are members of a set of just 
N such states, the final states must all be differcnt from 
the initial states. It is now clear why we chose the initial 
state of the trit which is altered as 2, and the final states 
as and 1. If we had chosen the final states to be 1 and 2, 
respectivcly, so that the state of the environment was not 
altered by the erasure when the system started in the 
state, then the following would be true: When the system 
started in the zero state, the environment would remain 
in the set of N states for which the macroscopic state- 
variables have their initial vàlues. However, when the 
system started in the 1 state, the state of the environment 
would be mapped outside this set of states. As a result, 
the final vàlues of the macroscopic state variables would 
record the initial state of the system, and the information 
would not have been erased. 

It is now clear that regardless of the initial state of the 
system, the entropy of the environment must increase, 
and what is more the minimum increase is the same for 
all initial states. 

In the above derivation we employed the fact that the 
reversible transformation used to erase the information is 
fixed, by which we mean that it cannot depend upon the 
initial state of the system or the environment. We now 
cxplain why this is the case. Most fundamentally this 
follows from the assumption that the universe is evolv- 
ing according to the laws of physics, and thus obeying 
an evolution due to a single reversible transformation. 
Thus there is only one transformation which does every- 
thing in the universe, including, necessarily, erasing the 
information in our system. 

However, from everyday experience we know that we 
could, if we chose, perform one of two unitary transforma- 
tions depending on the initial state of the system (we use 
unitary transformations here because we assume the that 
universe in quantum mechanical). It is thereforc worth 
examining how this is donc. It must be accomplishcd 



with a single unitary, and as result we require is a single 
unitary in a larger state space, which contains the two 
unitàries as sub-matrices. Each of the sub-matrices is an 
operator which acts on our system, and so is two dimen- 
sional. The full unitary thereforc is four dimensional as it 
contains two of these operators. It therefore acts in a four 
dimensional space, and we choose this to be the tensor 
product of two two-state systems. It is the state of the 
second "auxiliary" system which determines which uni- 
tary will act on our system. We wish this to depend on 
the state of our system, so we perform a unitary between 
our system and the auxiliary system which correlates the 
two - that is, sets the auxiliary to |1) of the system is 
in state |1), etc. Then we evolve the single unitary op- 
eration, which enacts either of the two unitàries on our 
system, and is thus able to prepare the system in state 
|0) regardless of its initial state. So what's the catch? 
The catch is that the auxiliary system now records the 
initial state of our system, and the information has there- 
fore not been erased. To put this more simply another 
way, if we use two unitàries to "erase" the state of the 
system, as we know we can, then we know which unitary 
we used and therefore the information has not, in fact, 
been erased. 

To sum up the above analysis, Landauer's erasure prin- 
cipio follows from four bàsic postulates: the definition of 
erasure, the statistical mechanical definition of entropy, 
the necessity of using a reversible transformation to per- 
form the erasure, and finally the fact that the unitary 
operation which is used must be independent of the state 
of the system. 
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